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Abstract. We present a novel approach to compute efficiently all the extreme elements of tropical 
polyhedra, which are the analogues of convex polyhedra in the max-plus algebra. It relies on a 
method successively eliminating the inequalities, and in which redundant generators are removed 
a priori. The cornerstone of this method is a new combinatorial characterization of the extreme 
elements of tropical polyhedra defined by inequalities: we prove that the extremality of an element 
amounts to the existence of a strongly connected component reachable from any other in a directed 
hypergraph. We show that the latter property can be checked in almost linear time in the size 
of the hypergraph. We give theoretical bounds and experimental results on various benchmarks, 
including instances arising from an application to static analysis, showing that our algorithm is 
faster than the previous ones by several orders of magnitude. 
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1 Introduction 



Tropical polyhedra are the analogues of convex polyhedra in tropical algebra. The latter deals 
with structures like the max-plus semiring M max (also called max-plus ulcjcbrcL^ , which is the set 
RU{ — oo}, equipped with the addition x®y := max(i, j/) and the multiplication x®y := x + y. 

The study of the analogues of convex sets in tropical or max-plus algebra is an active research 
topic, and has been treated in various guises. It arose in the work of Zimmerman [1], following 
a way opened by Vorobyev [2] , motivated by optimization theory. Max-plus cones were studied 
by Cuninghame-Green [3]. Their theory was independently developed by Litvinov, Maslov and 
Shpiz [4] (see also [5]) with motivations from variations calculus and asymptotic analysis, and 
by Cohen, Gaubert, and Quadrat [6, 7] who initiated a "geometric approach" of discrete event 
systems [8], further developed in [9, 10]. In [11, 12], Singer recasted this theory in the setting of 
abstract convexity [13]. Briec and Horvath [14] had similar motivations. The field has attracted 
recently much attention after the work of Develin and Sturmfels [15], who pointed out important 
connections with tropical geometry and applications to phylogenetical analysis. They developed 
a new combinatorial point of view, thinking of tropical polyhedra as polyhedral complexes in 
the usual sense (see also the series of works by Joswig, Yu, and the same authors [16-19]). 

A tropical polyhedron can be represented in two different ways, either internally, in terms 
of extreme points and rays, or externally, in terms of linear inequalities (see Sect. 2 for details). 
As in the classical case, passing from the external description of a polyhedron to its internal 
description is a fundamental computational issue. This is the object of the present paper. 

Butkovic and Hegedus [20] gave an algorithm to compute the generators of a tropical poly- 
hedral cone described by linear inequalities, with motivations from economic problems. Gaubert 
gave a similar one and derived the equivalence between the internal and external representa- 
tions [21, Ch. Ill] (see also [22]). Both algorithms rely on a successive elimination of inequalities, 
but have the inconvenient of squaring at each step the number of candidate generators. Then, 
an elimination technique must be incorporated to eliminate the redundant candidates. A first 
implementation of these ideas was included in the Maxplus toolbox of Scilab [23]. Joswig de- 
veloped a different approach, implemented in Polymake [24], in which a tropical polytope is 
represented as a polyhedral complex [19]. However, the latter has many more generators. 

The present work grew out of an application to software verification by static analysis [25]. 
Tropical polyhedra are used to automatically discover complex relations involving the operators 
min and max which hold over the variables of a program. These relations allow to prove the 
absence of bugs in the program. The execution time of the static analyzer directly depends on 
the efficiency of the algorithm computing the extreme elements of tropical polyhedra. While 
the results were encouraging, all the previously developed algorithms were too slow to make the 
static analyzer usable in practice. Similar limitations were observed in other applications [9, 10]. 

Contributions. We develop a new algorithm which computes the extreme elements of tropical 
polyhedra. It is based on a successive elimination of inequalities, and a result (Th. 1) allowing 
one, given a polyhedron V and a tropical halfspace H, to construct a list of candidates for 
the generators ofVnH. The key ingredient is a combinatorial characterization of the extreme 
generators of a polyhedron defined externally (Th. 2 and 3): we reduce the verification of 
the extremality of a candidate to the existence of a strongly connected component reachable 
from any other in a directed hypergraph. We show that the latter problem can be solved very 
efficiently (Th. 4), a result which is of independent interest. We include a complexity analysis 
and experimental results (Sect. 6), showing that the new algorithm outperforms the earlier ones, 
allowing us to solve instances which were previously by far inaccessible. 

2 Tropical Polyhedra and Polyhedral Cones 

The neutral elements for the addition © and multiplication (g), i.e., the zero and the unit, will 
be denoted by := — oo and 1 := 0, respectively. The tropical analogues of the operations on 
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Fig. 2. Intersecting a cone with a halfspace 



vectors and matrices are denned naturally. The elements of H^ ax , the dth fold Cartesian product 
of M ma x, will be thought of as vectors, and denoted by bold symbols, like x = (a?i, . . . , xA. 

A tropical halfspace is a set of the vectors x = (xi) G Mf nax verifying an inequality constraint 
of the form maxi<j<d a« + X; t < maxi<j<d bi + Xi, with Oj, 6j G K^ ax . A tropical polyhedral cone 
is defined as the intersection of n halfspaces. It can be equivalently written as the set of the 
solutions of a system of inequality constraints Ax < Bx. Here, A = (ctjj) and B = (6^) are 
nxd matrices with entries in M max , concatenation denotes the matrix product (with the laws of 
K max )) and < denotes the standard partial ordering of vectors. For sake of readability, tropical 
polyhedral cones will be often referred to as polyhedral cones or cones. 

Tropical polyhedral cones are known to be generated by their extreme rays [26-28]. Recall 
that a ray is the set of scalar multiples of a non-zero vector u. It is extreme in a cone C if u G C 
and if u = v © w with v,w £ C implies that u = voru = w.A finite set G = (g t )iei of vectors 
is said to generate a polyhedral cone C if each g % belongs to C, and if every vector x of C can 
be written as a tropical linear combination ® i Xig 1 of the vectors of G (with Aj G M m ax)- The 
tropical analogue of the Minkowski theorem [27, 28] shows in particular that every generating 
set of a cone that is minimal for inclusion is obtained by selecting precisely one (non-zero) 
element in each extreme ray. Note that in tropical linear combinations, the requirement that A, 
be nonnegative is omitted. Indeed, = —00 < A holds for all scalar A G M max . 

A tropical polyhedron of M^ax i s the affine analogue of a tropical polyhedral cone. It is 
defined by a system of inequalities of the form Ax © c < Bx © d. It can be also expressed 
as the set of the tropical affine combinations of its generators. The latter are of the form 
ie/ Aji? J © 0j 6 j Hjri , where the (v l )i<=j are the extreme points, the (r^)j & j a set formed 
by one element of each extreme ray, and (J^ A, = 1. It is known [7,27] that every tropical 
polyhedron of M^ ax can be represented by a tropical polyhedral cone of Mf n + X thanks to an 
analogue of the homogenization method used in the classical case (see [29, Sect. 1.5]). Then, 
the extreme rays of the cone are in one-to-one correspondence with the extreme generators of 
the polyhedron. That is why, in the present paper, we will only state the main results for cones, 
leaving to the reader the derivation of the affine analogues, along the lines of [27]. 

In the sequel, we will illustrate our results on the polyhedral cone C given in Fig. 1, defined 
by the system in the right side. The left side is a representation of C in barycentric coordinates: 
each element (xi,X2,X3) is represented as a barycenter with weights (e xi , e X2 , e X3 ) of the three 
vertices of the outermost triangle. Then two elements of a same ray are represented by the same 
point. The cone C is depicted in solid gray (the black border is included), and is generated by 
the extreme elements g° = (0,0,0), g l = (-2,1,0), g 2 = (2,2,0), and g 3 = (0,0,0). 

3 Principle of the Algorithm 

Our algorithm is based on a successive elimination of inequalities. Given a polyhedral cone C 
defined by a system of n constraints, the algorithm computes by induction on k (0 < k < n) a 
generating set G& of the intermediate cone defined by the first k constraints. Then G n forms a 
generating set of the cone C. Passing from the set G^ to the set G^+i relies on a result which, 
given a polyhedral cone K, and a tropical halfspace 7i = { x \ ax < bx}, allows to build a 
generating set G' of /C PI H from a generating set G of /C: 



Theorem 1. Let K, be a polyhedral cone generated by a set G C I^max? an d % = { x I ax — t> x } 
a tropical half space (a,b G M^f j. T/ien i/ie polyhedral cone )Cn7i is generated by the set: 

{g <E G \ ag < bg} U { (ah)g © (&g)ft \ g,h <E G, ag < bg, and ah > bft }. 

As an example, consider the cone denned in Fig. 1 and the constraint xi < + (depicted 
in semi-transparent gray in Fig. 2). The three generators g 1 , g 2 , and g 3 satisfy the constraint, 
while g° does not. Their combinations are the elements ft, 1 ' , h 2 ' , and ft 3 ' respectively. 

Nevertheless, Th. 1, and subsequently the whole inductive approach, may return non-extreme 
elements. (In the example represented in Fig. 2, only ft. 1 ' is extreme, whereas ft 2 ' and ft 3,0 are 
not.) As mentioned above, the extreme rays of a cone form a minimal generating set, so that 
it is useless to compute non-extreme elements. On top of that, the latter considerably degrade 
the performance of the approach: at each inductive step, the number of generators, among 
which many can be non-extreme, grows quadratically in the worst case (because of the pairwise 
combinations in Th. 1 of the g and ft). Hence the complexity of the inductive method in total 
is 0{o9 n ) both in time and space, which is untractable. 

For that reason, the approach is improved by including the elimination of non-extreme 
elements at each step of the induction. This idea is implemented in the algorithm Compute- 
Extreme (Fig. 3), which returns one element of each extreme ray of the cone C. The argument 
n corresponds to the number of constraints of the system Ax < Bx. When it is equal to 0, 
the cone coincides with M max , so that it is generated by the tropical canonical basis (e*)i<i<d 
(where e\ = 1 and e % - = if j ^ i). Otherwise, the system is split into Cx < Dx formed by the 
(n — 1) first inequalities, and ax < bx by the last one. Then the elements provided by Th. 1 are 
computed from the set G of extreme elements of the intermediary cone V = { x | Cx < Dx }. 
Note that the extremality test (Line 10) is applied only on the combinations (ag^g 1 © (bg t )g :) , 
and not on the elements g G G- which satisfies ag < bg. Indeed, the latter belong to C, and by 
induction hypothesis, they are extreme in the cone V, which is included in C. Therefore, they 
can be shown to be also extreme in C. Furthermore, before being appended to the set H, every 
combination ft = (agi)g l © (bg l )gi is first normalized to the element nh : so that the set H 
contains only one element of each represented ray. 

The efficiency of the extremality test used at Line 10 plays a crucial role in the whole 
performance of ComputeExtreme. The existing extremality criteria, which are implemented 
in the Maxplus toolbox of Scilab [23] and in our previous work [25] , are based on the following 
property: ft is extreme in the cone generated by a given set H if and only if ft can not be 
expressed as the tropical linear combination of the elements of H which are not proportional to 
it. This property can be checked in 0{d x \H\) time using residuation (see [28] for algorithmic 
details). If this criterion is used in our algorithm, the amortized complexity of the extremality 
test (over the sequence of tests performed in the loop from Lines 8 to 13) is 0(dx \ G\ 2 ), where G is 
the set of the extreme rays of the intermediary cone T>. Unfortunately, as confirmed theoretically 
and by our experiments, the set G can be very large, even if the cone C has few extreme rays (see 

1: procedure ComputeExtreme (A, B,n) t> A, B e H$m5x 
2: if n = then > Base case 
3: return (ei)l<i<d 

4: else > Inductive case 

5: split Ax < Bx into Cx < Dx and ax < bx, with C,D £ Imax''*"' and a, 6 £ IR^ax 

6: G := ComputeExtreme(C, D, n — 1) 

7: G< :={g'eG\ ag' < bg' }, G> := {gi 6 G | agi > bgJ }, H := G< 

8: for all g' £ G- and gi e G> do 

9: h := (agi)g i {bg')gi 

10: if h is extreme in { x \ Ax < Bx } then 

11: append nh to H , where k is the opposite of the first non-0 coefficient of h 

12: end 

13: done 

14: end 

15: return H 

16: end 

Fig. 3. Our main algorithm computing the extreme rays of tropical cones 




Fig. 4. Extremality of g 2 Fig. 5. The set g 2 + T(g 2 ,C) Fig. 6. The { 0, 1 }-elements of T(g 2 ,C) 

Sect. 6 for benchmarks and a discussion on the size of G). As a consequence, this extremality 
test is the bottleneck of the algorithm, and the latter may not terminate in a practical time 
even for small values of d or n (see Sect. 6). This motivated us to develop a novel approach to 
efficiently characterize the extremality of a vector in a cone defined by inequality constraints. 

4 Characterizing Extremality from Inequality Constraints 

Preliminaries on Extremality. The following lemma, which is a variation on the proof of Th. 3.1 
of [27] and on Th. 14 of [28], shows that extremality can be expressed as a minimality property: 

Proposition 1. Given a 'polyhedral cone C C M^, g is extreme if and only if there exists 
1 < t < d such that g is a minimal element of the set { x G C \ x t = g t }, i.e. g £ C and for 
each x G C, ( x < g and Xt = g t ) implies x = g. In that case, g is said to be extreme of type t. 

In Fig. 4, the light gray area represents the set of the elements (x±,X2, £3) of K max such that 
(x±,X2,xs) < g 2 implies x\ < g\. It clearly contains the whole cone except g 2 , which shows 
that g 2 is extreme of type 1 . 

A tropical segment is the set of the tropical linear combinations of two points. Using the 
fact that a tropical segment joining two points of a polyhedral cone C yields a continuous path 
included in C, one can check that g is extreme of type t in C if and only if there is a neighborhood 
N of g such that g is minimal in {x £ C Pi N \ x± = g t }- Thus, extremality is a local property. 

Finally, the extremality of an element g in a cone C can be equivalently established by 
considering the vector formed by its non-0 coordinates. Formally, let supp(a:) := {i \ X{ 7^ 0} 
for any x G K max - One can show that g is extreme in C if and only if it is extreme in {x G C | 
supp(«) C supp(g)}. This allows to assume that supp(gi) = {1, . . . , d} without loss of generality. 

Expressing Extremality Using the Tangent Cone. For now, the polyhedral cone C is supposed 
to be defined by a system Ax < Bx of n inequalities. 

Consider an element g of the cone C, which we assume, from the previous discussion, to 
satisfy supp(g) = {l,...,d}. In this context, the tangent cone of C at g is defined as the 
tropical polyhedral cone T(g,C) of M max given by the system of inequalities 

max Xi < max Xj for all k such that A^g = B^g, (1) 

ieargmax(yl fc g) jearg max(B fc g) 

where for each row vector c G ij^, argmax(cgr) is defined as the argument of the maximum 
eg = max 1 <j< ( i(cj + grj, and where A^ and B^ denote the A;th rows of A and B, respectively. 
The tangent cone T(g, C) provides a local description of the cone C around g: 

Proposition 2. There exists a neighborhood N of g such that for all x G N, x belongs to C if 
and only if it is an element of g + T(g,C). 

As an illustration, Fig. 5 depicts the set g 2 + T(g 2 ,C) (in semi-transparent light gray) when 
C is the cone given in Fig. 1. Both clearly coincide in the neighborhood of g 2 . Since extremality 
is a local property, it can be equivalently characterized in terms of the tangent cone. Let 1 be 
the element of M^ ax whose all coordinates are equal to 1. 



Proposition 3. The element g is extreme inC if and only if the vector 1 is extreme inT(g,C). 

The problem is now reduced to the characterization of the extremality of the vector lina{0,l}- 
cone, i.e. defined by a system of the form Cx < Dx where C,D € {0,l} raxd . The following 
proposition states that only {0,1 }-vectors (i.e. elements of { 0, 1 } d ) have to be considered: 

Proposition 4. Let V C Mf nax be a cone defined by a system Cx < Dx with C, D £ {0,1 } nxd . 
Then 1 is extreme of type t iff it is the unique element x of V n { 0, 1 } d satisfying x t = 1. 

The following criterion of extremality is a direct consequence of Prop. 3 and 4: 

Theorem 2. Let C C M^ ax be a polyhedral cone. The element g G C is extreme of type t if and 
only ifl is the unique { 0, 1 ^-element of the tangent cone T(g,C) whose t-th coordinate is 1. 

Figure 6 shows that in our running example, the {0,1 }-elements of T(g 2 ,C) distinct from 1 (in 
squares) all satisfy x\ = 0. Naturally, testing, by exploration, whether the set of 2 d ~ 1 {0,1}- 
elements x verifying xt = 1 belonging to T(g, C) consists only of 1 does not have an acceptable 
complexity. Instead, we propose to use the equivalent formulation of the criterion of Th. 2: 

V/ £ { 1, . . . , eZ }, [Vx£T(g,C)n{®,l} d , x, = => x t = 0]. (2) 

Characterizing Extremality with Directed Hypergraphs. A directed hypergraph is a couple (N, E) 
such that each element of E is of the form (T, H) with T, H C N . The elements of N and E 
are respectively called nodes and hyperedges. Given a hyperedge e = (T, i7) £ -E, the sets T and 
i/ represent the tail and the head of e respectively, and are also denoted by T(e) and H(e). 
Figure 8 depicts an example of hypergraph whose nodes are u, v, w, x, y, t, and of hyperedges 
e i = (M, W), e 2 = (M, {«>}), e 3 = ({w},{u}), e 4 = ({v,u>},{x,y}), and e 5 = ({w,y},{t}). 

Reachability is extended from digraphs to directed hypergraphs by the following recursive 
definition: given u,v £ N, then v is reachable from u in 7i, which is denoted u v, if u = v, 
or there exists e 6 E such that v G i?(e) and all the elements of T(e) are reachable from u. This 
induces a notion of hyperpaths, defined as a sequence of hyperedges of E, which can be empty 
if u = v, or of the form e±, . . . ,e p with T(ej) C {u} U H{e{) U • • • U H(ei-±) for all 1 < i < p, 
and v £ H(e p ). In our example, t is reachable from u through the hyperpath e±, e2, e±, e$. 

The size of a hypergraph H = (N, E), denoted by s\ze(H), is defined as \N\+^2 eeE (\T(e)\ + 
\H(e)\). In the rest of the paper, directed hypergraphs will be simply referred to as hypergraphs. 

We associate to the tangent cone T(g,C) the hypergraph 7i(g,C) = (N,E) defined by: 

N = { 1, . . . ,d} E = { (&rgm&y:(B k g),argmax(A k g)) \ A k g = B k g, 1 < k < n] . 

The extremality criterion of Eq. (2) suggests to evaluate, given an element of T(g, C) n { 0, 1 } d , 
the effect of setting its Z-th coordinate to the other coordinates. Suppose that it has been 
discovered that x\ = implies Xj 1 = ••• = Xj n = 0. For any hyperedge e of H(g,C) such 
that T(e) C { l,j±, . . . ,j n }, x satisfies: maxj g #( e ) Xi < maXj Gr( - e ) Xj = 0, so that x\ = for all 
i £ H(e). Thus, the propagation of the value from the Z-th coordinate to other coordinates 
mimicks the inductive definition of the reachability relation from the node Z in 7i.(g,C): 

Proposition 5. For all I £ { 1, . . . ,d}, the statement given between brackets in Eq. (2) holds 
if and only if t is reachable from I in the hypergraph Tt(g,C). 

Hence, the extremality criterion can be restated thanks to some considerations on the strongly 
connected components of Tt(g,C). The strongly connected components (Sees for short) of a 
hypergraph Ti, are the equivalence classes of the equivalence relation =^, defined by u =% v if 
u -^>n v and v u. They form a partition of the set of nodes of 7i. They can be partially 
ordered by the relation defined by C\ -<n Ci if C\ and Ci admit a representative u and v 
respectively such that v u (beware of the order of v and u in v u). Then Prop. 5 and 
Th. 2 imply the following statement: 

Theorem 3. Let C C M^ ax be a polyhedral cone, and g £ C. Then g is extreme if and only 
if the set of the Sees of the hypergraph TL(g,C), partially ordered by ^n(g,C)> admits a least 
element. 



5 Determining the Minimal Sees in Hypergraphs 



Following Th. 3, we describe an algorithm which determines the minimal Sees for the order 
■< in hypergraphs. In particular, it returns the number of such Sees, which is equal to 1 if 
and only if there exists a least Sec. Our approach is best understood by presenting first an 
instrumentation of Tarjan's algorithm [30] which applies only on directed graphs (digraphs), 
and has a linear time complexity The order ^ can be defined on the Sees of digraphs as well. 
Note that in a digraph (or in a hypergraph), a Sec is minimal for this order if and only if it 
has no leaving (hyper)edge (except those entering itself). Therefore, a Sec is a least element if 
and only if it is reachable from any other Sec. Besides, if C\ <g C2 in a digraph G, then C\ is 
greater than or equal to C2 in the topological order. 

The algorithm for digraphs is given in Fig. 7 with the version of MinSccCount calling the 
visiting function GVisit at Line 10 (parts highlighted in gray can be ignored). The array index 
tracks the order in which the nodes are visited: index[u] = i if the node u is the ith one to be 
visited. The value lowfw] is used to determine the minimal index of the nodes which are reachable 
from u in the digraph. A Sec C is discovered as soon as a node u satisfies low[u] = index[u\ 
(Line 27). Then C consists of all the nodes stored in the stack S above u. The node u is the node 
of the Sec which has been visited first, and is called its root. The main difference between our 
algorithm and Tarjan's original one is that the nodes v are provided with a boolean ismin[v] 
allowing to track the minimality of the Sec. A Sec is minimal if and only if its root u satisfies 
ismin[u] = true. In particular, the boolean ismin[v] is set to false as soon as it is connected to a 
node w located in a distinct Sec (Line 21) or satisfying ismin[w] = false (Line 24). The counter 
nb determines the number of minimal Sees which have been discovered (see Line 28). For the 
sake of brevity, we have removed the operations allowing to return the Sees. Instead, when a 
node is discovered to be in a Sec, it is placed in the set Finished (Line 30). 

Nevertheless, this algorithm can not be applied on hypergraphs, since the reachability rela- 
tion in the latter is much more complex. However, determining the minimal Sees in digraphs 
can be helpful. First observe that a digraph G(H) = (N, E') can be associated to any hyper- 
graph H = (N, E), by defining E' = { (t,h) | ({t},H) G E and h € H }. The digraph G(H) is 
generated by the simple hyperedges of 7i, i.e. the elements e 6 E such that |T(e)| = 1. The 
minimal Sees of 7i and G(7i) coincide in the following remarkable special case: 

Proposition 6. Let H be a hypergraph such that each minimal Sec of GiTL) is reduced to a 
singleton. Then H and G(H) have the same minimal Sees. 

If / is a function from N to an arbitrary set, we denote by f(Ti) the hypergraph of nodes 
f(N) and of hyperedges { (/(T(e)), f(H(e))) | e £ E}. The following proposition ensures that, 
in a hypergraph, merging two nodes of a same Sec does not alter the reachability relation: 

Proposition 7. Let 7i = (TV, E) be a hypergraph, and let x,y £ N such that x =n y. Consider 
the function f mapping any node distinct from x and y to itself, and both x and y to a same 
node z ( with z N\ {x,y}). Then u -w^ v if and only if f(u) ~~> f(v) . 

Thus, the minimal Sees of 7i and f(T~C) are in one-to-one correspondence. Using Prop. 6 
and 7, we now sketch a method which computes the minimal Sees in a hypergraph 7i: starting 
from the hypergraph 7i CU r image of TL by the map u ^ { u }, 

(i) compute the minimal Sees of the digraph G(7i cur ). 

(ii) if one of them, say C, is not reduced to a singleton, replace TL C ur by f(Ti. C ur), where / 
merges all the elements U of C into the node UrjeC ^ • Then go back to Step (i). 

(iii) otherwise, return the number of minimal Sees of the digraph G(H C ur)- 

Each time the node merging step (Step (ii)) is executed, new edges may appear in the digraph 
G(7i cur ). This situation is illustrated by Fig. 9. In both sides, the edges of G(TC cur ) are depicted 
in solid, and the non-simple hyperedges ofH. cur in dotted line. The nodes oiTL CUT contain subsets 
of N, but enclosing braces are omitted. Applying Step (i) from node u (left side) discovers a 
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function MlNSccCouNT(iV, F) 

n := 0, nb := 0, S := [], Finished := 
for all e G E do r e := undef , c e := 
for all u 6 N do 

index[u] := undef, low[u] := undef 
F u := [], Makeset(m) 
done 

for all u a N do 

if index[u] = undef then 



> if (N, E) is a digraph 
t> if it is a hypcrgraph 



f GVisit(u) 
\or HVisrr(tt) 
end 
done 
return nb 
end 



function GVisit(m) 

index[u] := n, low[u] := n, n := n + 1 
ismin[u] := true, push u on the stack S 
for all (u, w) 6 E do 

if mdez[iij] = undef then GVisit(u)) 
if w 6 Finished then 

ismin [u] := /a/se 
else 

ioto[u] := min((oty[ii]. 
ismin[u] := ismin [u] 
end 
done 

if io«;[u] = index[u] then 

if ismin[u] = true then nb : 
repeat 

pop v from S, add d to Finished 
until index[v] = index[u] 
end 
end 

function HVisit(u) 

local U := Find(u), local F := [] 
index[U] := n, low[U] := n, n := n + 1 
ismin [[/] := true, push (7 on the stack S 
for all e £ E u do 

if |T(e)| = 1 then push e on F 



, lo«i[«i]) 
; ismin \w] 



nb + 1 
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else 

if r e = undef then r e := u 
local R e := FlND(r e ) 
if R e appears in S then 

C e := C e + 1 

if c e = |T(e)| then 

push e on the stack Fr 
end 
end 
end 
done 

while F is not empty do < 

pop e from F 
for all w £ Fl(e) do 
local W := Find(id) 
if index[W] = undef then HViSIt(ui) 
if W S Finished then 

ismin[U] := false 
else 

low[U] := min(low[U], low[W]) 
ismin[U] := ismin[U] &&: ismin [W] 
end 
done 
done 

if low[U] = index[U] then 
if ismin[U] = true then 
local i := index[U] 

pop each e from Fy and push it on F 

pop V from 5 

while index[V] > i do 

pop each e from Fy and push it on F 

[/ := Merge(C7, V) 

pop V from S 
done 

index[U] := i, push U on S 

if F is not empty then go to Line 51 -j 

nfe := raft + 1 
end 
repeat 

pop V from S, add V to Finished 
until index[V] = index[U] 
end 
end 



auxiliary data up- 
date step 



Step (i) 



> a minimal Sec is discovered 



Step (ii) 



Fig. 7. Computing the minimal Sees in hypergraphs 

minimal Sec formed by u, v, and w in the digraph G(TC cur ). At Step (ii) (right side), the nodes 
are merged, and the hyperedge is transformed into two graph edges leaving the new node. 

The termination of this method is ensured by the fact that the number of nodes in H CU r 
is strictly decreased each time Step (ii) is applied. When the method is terminated, minimal 
Sees of n cur are a,U reduced to single nodes, which contain subsets of N . Propositions 6 and 7 
prove that these subsets are precisely the minimal Sees of H. Besides, the method returns 
the exact number of minimal Sees in H. However, in order for this approach to be efficient, 
the algorithm should avoid computing a same minimal Sec several times. For this reason, we 
propose to directly integrate the node merging step in our first algorithm on digraphs. 

First observe that the nodes of the hypergraph 7i cur always form a partition of the initial 
set ./V of nodes. Instead of referring to them as subsets of TV, we use a union-find structure, 
which consists in three functions Find, Merge, and MakeSet (see e.g. [31, Chap. 21]): 

• Find(m) returns, for each original node u 6 N, the unique node of TL C ur containing u. 

• two nodes U and V of 7i cur can be merged by Merge(?7, V), which returns the new node. 

• the "singleton" nodes { u } of the initial H CU r are created by a function MakeSet. 

With this structure, each node of H C ur lS represented by an element u G N, and then corresponds 
to the subset {v £ N \ Find(w) = u}. Nevertheless, we avoid confusion by denoting the nodes 
of the hypergraph 7i by lower case letters, and the nodes of 7i cur by capital ones. By convention, 
if u £ N, Find(-u) will correspond to the associated capital letter U. Note that when an element 
u G N has never been merged with another one, it satisfies Find(u) = u. 



Fig. 8. A directed hypergraph Fig. 9. A merging node step (the index of the visited nodes is given beside) 

The resulting algorithm on hypergraphs is obtained by using the function HVisit at Line 10. 
We present the main ideas used in the correctness proof of the algorithm, highlighting the 
differences with the first algorithm on digraphs. Albeit TL cur is not explicitly manipulated, it 
can always be inferred as the image of H by the function Find. The visiting function HVlSlT(it) 
computes the minimal Sees reachable from the node Find(u) in the digraph G(H C ur), using 
the same method as in GVisit (see the part corresponding to Step (i), from Lines 51 to 63). 
However, as soon as a minimal Sec is discovered, the node merging step (Step (ii)) is executed. 

Node Merging Step. It is performed from Lines 66 to 75, when the node U = Find(m) is the root 
of a minimal Sec in G(H CU r)- All nodes V which have been discovered in that Sec are merged 
to U (Line 7i). Let H new be the resulting hypergraph. At Line 75, the stack F is supposed to 
contain the new edges of G{7i new ) leaving the newly "big" node U. If it is empty, {U} is a 
minimal Sec of G(7i new ), hence also oiH new (Prop. 6). Thus nb is incremented. Otherwise, we 
go back to the beginning of Step (i) to discover minimal Sees from the new U in G(7i new ). 

Discovering the New Graph Edges. During the execution of HVisit(u), the local stack F is 
used to collect the hyperedges which represent edges leaving the node Find (it) in the digraph 
G(H C ur)- When HVisit(u) is called, Find(u) is initially equal to u. The loop from Lines 38 to 50 
iterates over the set E u , which consists of the hyperedges e G E such that u G T{e). (The E u 
can be built in linear time by traversing the set E before running MinSccCount.) At the end 
of the loop, F is filled with all the simple hyperedges leaving u = Find(u) in H cur , as expected. 

Now the main difficulty is to collect in F the edges which are added to the digraph G{TL C ur) 
after a node merging step, without examining all the non-simple hyperedges. To overcome this 
problem, each non-simple hyperedge e G E is provided with two auxiliary data: a counter c e > 0, 
and a node r e called its root. Invariants defining r e and c e are given in Fig. 10. Observe that at 
the call to HVisit(u), the counter c e of each non-simple hyperedge e G E u is incremented only 
when R e = FiND(r e ) belongs to the stack S (Line 44). This indeed holds if and only if Find(u) 
is reachable from FiND(r e ) in the digraph G(7i cur ). 

According to Fig. 10, when c e = |T(e)|, all the nodes X = Find(x) (for x G T(e)) are 
reachable from R e in G(7i cur ). Now suppose that, later, it is discovered that R e belongs to a 
minimal Sec of G(7i cur ). Then all the X stand in the same Sec. (Indeed, if C is a minimal 
Sec and t G C, z is reachable from t if and only if z G C.) Hence, when the node merging step 
is applied on this Sec, the X are merged into a single node U. In that case, the hyperedge e 
generates new simple edges leaving U in the new version of the digraph G(7i cur ). Now let us 
verify that e is correctly placed into F by our algorithm: as soon as c e reaches the threshold 
|T(e)|, e is placed into a temporary stack Fr e associated to the node R e (Line 46). It is then 
emptied into F at Lines 67 or 70 during the node merging step. 

For example, in the left side of Fig. 9, the execution of the loop from Lines 38 to 50 during 
the call to HViSiT(f) sets the root of the hyperedge e^ to the node v, and c e4 to 1. Then, during 
HVisit(u>), c e4 is incremented to 2 = |T(e4)|. The hyperedge is therefore pushed on the stack 
F v (because R ei = FiND(r e4 ) = Find(w) = v). Once it is discovered that u, v, and w form a 
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description 




element of N or undef 


first node x of T(e) to be visited by a call to HVisit 


Ce 


integer 


number of elements x € T(e) such that index[x] is defined, and Find(x) is reach- 
able from FlND(r e ) in the current graph G(TL CU r) 


F 


stack local to H Visit (it) 


contain the hyperedges which represent edges leaving the node Find(m) in G{H CU r) 



Fig. 10. Invariants on auxiliary data in HVisit 



minimal Sec of G(H cur)j e 4 is collected into F. It then allows to visit the node x from the new 
node (rightmost hypergraph). A fully detailed execution trace is provided in Appendix E. 
Complexity. Using disjoint-set forests with union by rank and path compression as union-find 
structure (see [31, Chapter 21]), a sequence of p operations MakeSet, Find, or Merge can 
be performed in time 0(p x a(|iV|)), where a is the very slowly growing inverse of the map 
x i — ^ A(x,x), and where A is the Ackermann function. Then the following statement holds: 

Theorem 4. Let Ti = (N, E) be a hypergraph. Then MinSccCount(H) returns the number 
of minimal Sees in 7i in time 0{s\ze(TL) x a(|iV|)). Besides, the minimal Sees are formed by 
the sets { v € N | Find(-u) = U and ismin[U] = true }. 

For any practical value of x, a(x) < 4. That is why the complexity of MinSccCount is said 
to be almost linear in size(H). Finally, our method does not allow to determine all Sees, which 
is apparently a harder problem (see Appendix F for further details). 

6 Resulting Algorithm and Experiments 

Thanks to Th. 3, the equality test MiNSccCouNT(W(h,, C)) = 1 can be used to determine 
the extremality of h at Line io in ComputeExtreme. Since the hypergraph TL{h,C) can be 
computed in time 0(nd), which is also the order of magnitude of its size, the complexity of each 
extremality test is 0{nda{d)). Using the notations of Sect. 3, this represents an improvement 
factor of 0{\G\ 2 /{na{d))) over the previous extremality test presented in Sect. 3. As discussed 
below and confirmed by the experiments, the size of G, which is the number of extreme rays 
in the intermediary cone T>, is much larger than na(d) in general, so that the performance of 
ComputeExtreme is significantly improved. 

In classical geometry, the upper bound theorem of McMullen [32] shows that the maximal 
number of extreme points of a convex polytope in M d defined by n inequality constraints is 
equal to U(n,d) := (™~ L( „ d +J )/2j ) + (" _L( n ^ )/2j ). The cyclic polytopes (see [29]) are known to 
reach this bound. In the tropical setting, a recent work of Allamigeon, Gaubert, and Katz [33] 
proves that the number of extreme rays of a tropical polyhedral cone C in M^ ax defined by n 
inequalities is bounded by similar quantity, U(n+d, d— 1). The latter is of an order of magnitude 
of (n + d) d / 2 . Besides, natural candidates to be the maximizing cones, named tropical signed 
cyclic polyhedral cones, have been studied in [33]. While they asymptotically reach the bound 
U(n + d, d— 1) when d — ► +oo and n is fixed, the bound does not seem to be attained in general. 
However, a lower bound on the number of extreme rays of signed cyclic polyhedral cones have 
been established for various cases: their order of magnitude are 0((n - 2d)2 d ~ 2 ) when n > 2d, 
and 0(d^ d ~ n ~ 1 ^ 2 ) when d > 2n + 1. This confirms our previous statement on the size of G. 

Like Moztkin's double description method [34,35], the performance of ComputeExtreme 
depends on the size of the sets returned in the intermediate steps. However, the complexity of the 
elementary step, i.e. the computation of the elements provided by Th. 1 and the elimination of 
non-extreme ones (Lines 7 to 13), can be precisely characterized: 0(nda(d) |G| 2 ). In comparison, 
the same step in the double description method relies on a combinatorial characterization of the 
adjacency of rays, of complexity 0(n |G| 3 ) [35]. As in general, the size of G is much larger than 
da(d), the elementary step in our algorithm has a better complexity than its classical analogue. 

A common point between our algorithm and Motzkin's approach is that the result provided 
by our algorithm does not depend on the order of the inequalities in the initial system. This 
order may impact the size of the intermediary sets and subsequently the execution time. In 
our experiments, inequalities are dynamically ordered during the execution: at each step of the 
induction, the inequality ax < bx is chosen so as to minimize the number of combinations 
{ag ] )g l © {bg l )g 3 . Note that this strategy does not guarantee that the size of the intermediate 
sets of extreme elements is smaller. However, it reports better results than without ordering. 

Benchmarks. The algorithm ComputeExtreme has been implemented in a prototype written 
in OCaml. Table 1 reports some experiments for different classes of tropical cones: samples 



Table 1. Execution time benchmarks on a single core of a 3 GHz Intel Xeon with 3 Gb RAM 





d 


n 


# final 


# inter. 


T(s) 


T> (s) 


T/T' 


rndlOO 


12 


15 


32 


59 


0.24 


6.72 


0.035 




rndlOO 


15 


10 


555 


292 


2.87 


321.78 


8.9 • 10" 


3 


rndlOO 


15 


18 


152 


211 


6.26 


899.21 


7.0- 10" 


3 


rnd30 


17 


10 


1484 


627 


15.2 


4667.9 


3.3 • 10" 


3 


rndlO 


20 


8 


5153 


1273 


49.8 


50941.9 


9.7- 10" 


4 


rndlO 


25 


5 


3999 


808 


9.9 


12177.0 


8.1 • 10" 


4 


rndlO 


25 


10 


32699 


6670 


3015.7 








cyclic 


10 


20 


3296 


887 


25.8 


4957.1 


5.2 ■ 10- 


-3 


cyclic 


15 


7 


2640 


740 


8.1 


1672.2 


5.2 • 10" 


-3 





d 


n 


# final 


# inter. 


T(s) 


T' (s) 


T/T' 


cyclic 


17 


8 


4895 


1589 


44.8 


25861.1 


1.7- 10" a 


cyclic 


20 


8 


28028 


5101 


690 


> 10 days* 


< 8.0- 10~ 4 


cyclic 


25 


5 


25025 


1983 


62.6 


> 8 days* 


< 9.1 • 10~ 5 


cyclic 


30 


5 


61880 


3804 


261 






cyclic 


35 


5 


155040 


7695 


1232.6 









# var 


# lines 


T(s) 


T> (s) 


T/T' 


oddcvcn8 
oddcven9 
oddevcnlO 


17 
19 
21 


118 
214 
240 


7.6 
128.0 
1049.0 


152.1 
22101.2 


0.050 
5.8 • 10~ 3 



formed by several cones chosen randomly (referred to as rndx where x is the size of the sample), 
and signed cyclic cones which are known to have a very large number of extreme elements. For 
each, the first columns respectively report the dimension d, the number of constraints n, the 
size of the final set of extreme rays, the mean size of the intermediary sets, and the execution 
time T (for samples of "random" cones, we give average results). We compare our algorithm 
with a variant using the extremality criterion which is discussed in Sect. 3 and used in the other 
existing implementations [23,25]. Its execution time T' is given in the seventh column. The 
ratio T/T' shows that our algorithm brings a huge breakthrough in terms of execution time. 
When the number of extreme rays is of order of 10 4 , the second algorithm needs several days 
to terminate (the asterisk means that its execution was not terminated at the moment of the 
submission). Therefore, the comparison could not be made in practice for some cases. 

Table 1 also reports some benchmarks from applications to static analysis. The experiments 
oddeveni correspond to the static analysis of the odd-even sorting algorithm of i elements. It 
is a sort of worst case for our analysis. The number of variables and lines in each program is 
given in the first columns. The analyzer automatically shows that the sorting program returns 
an array in which the last (resp. first) element is the maximum (minimum) of the array given as 
input. It clearly benefits from the improvements of ComputeExtreme, as shown by the ratio 
with the execution time T' of the previous implementation of the static analyzer [25] . 



7 Conclusion and Related Work 

This work provides an efficient method to compute all extreme elements of tropical polyhedra 
and polyhedral cones. It is based on a successive elimination of inequalities, and a criterion to 
characterize extremality of elements by determining the existence of a Sec reachable from any 
other in a directed hypergraph, in almost linear time of its size. Existing approaches [20-22, 25] 
are also based on a successive inequality elimination technique, however their elementary step 
differs from the result of Th. 1, so that in practice, possibly more extremality tests are performed 
in their existing implementations [23, 25]. Besides, these tests rely on the method described in 
Sect. 3. As discussed in Sect. 6, they are consequently less efficient than our algorithm. 

The algorithm MinSccCount is also a noteworthy contribution, since as far as we know, 
there is no method in the existing literature to determine minimal Sees for the order ^, in linear 
(or almost) time in directed hypergraphs. Alternatively, we could have used Gallo's et al. [36] 
linear algorithm to compute the set of reachable nodes from each node and obtain the whole 
reachability graph, but the total complexity would have been 0(|-/V| size(H)), i.e. a slowdown 
of almost a factor \N\. 

We hope that our method can be considered as a tropical analogue of the very popular double 
description method of Motzkin, which is able to handle various kinds of convex polyhedra. Avis 
and Fukuda have developed a method which is polynomial for a subclass of non- degenerate 
polyhedra [37, 38]. As a future work, it would be very interesting to adapt it for a class of tropical 
polyhedra. This is a longer term objective, as tropical geometry is a young theory, and several 
difficulties still have to be overcomed: for instance, the notion of faces is not yet well defined 
in the tropical setting (see [17] for a discussion). However, we believe that our characterization 
of extreme points in terms of directed hypergraphs is a first step towards understanding the 
combinatorics of tropical polyhedra, and will be helpful to develop further algorithms. 
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A Proof of Th. 1 



Let G' be the family of elements provided by Th. 1. The inclusion cone(G') C C HTi is obvious. 

Now consider x € C n H. Suppose that G = { g 1 , . . . , g p }. Let Ai, . . . , X p € Kf nax such that 
x = 0f =1 Ajgi 1 . Observe that ax < bx implies: 

Hag 1 ) © A,W)< Hbg 1 ) © A,(V). (3) 

ag^<bg l agi>bgi a^ i <bg i ag3>bgi 

Suppose that og 3>5 g j Xj(bgi) > a9 i<5 g i Xi(bg l ). Then there must beafe such that Xk(bg k ) 
is equal to ag j > 5 9 j Xj{bgi), and necessarily A& > 0. But (3) leads to Xk(bg k ) > Xk(ag k ) while 
ag fc > which is a contradiction. It follows that agJ> 5 g j Xj(bgi) < ag ;<5 g i Xi(bg l ), so 
that, by (3), 

A,W)< W)- 

agj>bgi ag i <bg i 

Let /t be the right member of (4). If k > 0, then 



(4) 



x= W 

ag ! < bg 1 

= w 

ag ! < bg 1 



W 

agJ <6g3 

f A ^W) 
agiKbgi L ag3>bg] 



© « _1 © W) 

agi>bgi ag l <bg l 

W © k- 1 ^[(o^Ve^V] 

ag i <bg i ag i <bg i 



ag 3 >bg 3 



which shows x € cone(G'). Otherwise, n = 0. By (4), Xj{agi) = for each j such that 
ag J > 6gr J , hence Xj = 0. It follows that x = a9 i<5 g ; Xig 1 , thus # G cone(G'). □ 



B Proofs of the Statements in Sect. 4 



This section also includes additional material which was only sketched in Sect. 4 for sake of 
brevity. 

Proof (Prop. 1 ). If there exists 1 < t < d such that g is minimal in { x € C \ x± = g t }, then let 
x 1 , x 2 e C such that g = x 1 © x 2 . In that case, for each i E {1,2}, x l < g, and there is an i 
such that x\ = g t , so that x l = gr. 

Conversely, assume that for every index t, g is not minimal in the set { x £ C \ Xt = g t }, so 
that we can find a vector x l such that x l < g, x\ = <7 t , and x l / g\ As a result, g = a: 1 ©. . .®x d . 
Since no x l is equal to g, this shows that g cannot be extreme. □ 

The following proposition shows that extremality can be expressed as a local property. 

Proposition 8. Given a polyhedral cone C C K„ iax , g is extreme of type t if and only if there 
exists a neighborhood V of g such that g is a minimal element of the set {x € C (~)V \ x± = g t } . 

Proof. If such a neighborhood V exists, let us consider x £ C such that x < g and xt = g t . 
Suppose that x is distinct from g. Then any element of the form y = x © ag with a < also 
satisfies y < g, y t = g t , and y / g. Now, for a enough close to g, y belongs to V, which 
contradicts the extremality of g. □ 



Lemma 1. If g ^2 to is extreme of type t in a cone C C M^^, then g t ^to. 

Proof. Suppose that g t = to, and let g' = (— l)g. Then g' G C, g' < g, and g' / g since g / 0. 
This is a contradiction with the extremality of type t of g. □ 

Given a subset / of { 1 , . . . , d }, let 7r/ be the function mapping any x G M^ ax onto the 
element consisting of its coordinates for each i <E I. Then the following proposition holds: 

Proposition 9. Let C C Mf nax 6e a polyhedral cone, and g ^ to an element of C such that 
supp(S'). Then the three following statements are equivalent: 

(i) g is extreme in C, 

(ii) g is extreme in { x G C | supp(a?) C S}, 

(Hi) iTs(g) is extreme in tts({x G C | supp(a;) C S}). 

Proof. For sake of simplicity, we assume that S = {1, . . . ,p}, where p is the cardinality of S. 
Let V := { x G C | supp(x) C S }, £ := 7rs(£>) and h := irsid)- 

(i) =^ (ii) Supposing g extreme in C, g is obviously extreme in V since gGP and V C C. 

(ii) (hi) Now, suppose g extreme of type t in the cone T>. By Lemma 1, t belongs to S. 
Consider y G £ such that y < h and y t = h t . Let x G V such that y = 7rs(a:). Then x < g 
and x t = y t = h t = g t . As a consequence, x = g, which implies y = h. 

(iii) =^ (i) Finally, if h is extreme of type t in £, let x G C such that x < g and = gr t . Then 
supp(:r) C S, hence y = tts(x) satisfies y G £, y < h and y t = h t . This shows y = h, so 
that x = g. □ 

This allows to suppose in Sect. 4 that supp(g) = { 1, . . . , d} without loss of generality. 

Proof (Prop. 2). Consider a neighborhood iV in which all elements x satisfy the following 
conditions: 

• A^x < B^x for all k such that A^g < Bj,g, 

• arg max(Akx) C arg max(Akg) and argmax(i?fcic) C arg max(Bkg) for any other k. 
Let x G N. Note that x belongs to C if and only if, for each k verifying Aj,g = B^g, 

max (ciki + Xi) < max (b^j+Xj), (5) 

iSarg max(A fe g) j'Garg ma,x(B k g) 

by definition of N. 

Suppose that x belongs to C. Let k such that A^g = .B^gr > 0. Since for all i G arg max(Akg) 
and j G argmax(i?fcgr), ati+gi = bkj+gj > 0, the term aki+gi (resp. bkj+gj) can be substracted 
from cifcj + «j (resp. + a?j) in (5), which shows: 

max \xi- gi )< max (xj - gj. (6) 

ieargmax(.4 fc g) jGarg max(_B fe g) 

Now, if A^g = B^g = 0, then arg max(^4fegi) = arg max (B^g) = {1, . . . ,d}, hence (6) is a 
tautology. 

Conversely, suppose that x — g is an element of T(g,C). Consider k such that A^g = B^g. 
Adding the term aki + gi (resp. bkj + gj) to each Xi — g { (resp. Xj — g^) in (6) shows that x 
satisfies (5). □ 

Proof (Prop. 3). Let N' be the set consisting of the elements x — g for x G N, where iV is given 
by Prop. 2. First remark that g G C if and only if 1 G T(g, CC). 

Suppose that g is extreme of type t in the cone C, and consider y G T(g,C) n AT' such that 
y < 1 and y 4 = 1. Let x = g + y, then a; G C by Prop. 2. Besides, x < g and x t = g t , so 



that x = g and y = 1. As N' is a neighborhood of 1, this proves that 1 is extreme of type t in 
T(g,C) by Prop. 8. 

Conversely, suppose that 1 is extreme of type t in T(g,C). Let x G C n N verifying x < g 
and Xt = g t . li y = x — g, then by Prop. 2 that y is an element of T(g,C). Moreover, y < 1 
and y t = 1, so that y = 1 and x = g. □ 

Proof (Prop. 4)- If 1 is extreme of type t, then it is minimal in the set { x G V \ x t = 1 }, so 
that it is also minimal in {x G X> n { 0, 1 } d | x t = 1 }. It follows that the latter set is reduced 
to {1}. 

Conversely, consider x G T> such that x < 1 and Xt = 1. Let (x k )k>i be the sequence 
defined by x k := k x Xj. Since the coefficients of the matrices C and D are in the set {0,1}, 
it is straightforward that each x k belongs to T>. Moreover, the sequence (x fc )&>i obviously 
converges to an element x°° G {0,l} d , such that x°° < x k for all k > 1. It follows that 
x°° < 1, and a;^ = 1. Besides, the cone P is closed, thus x°° G P. This implies x°° = 1 by 
assumption, which leads to x = 1. □ 

Proof (Th. 2). It is a direct consequence of Prop. 3, and of the application of Prop. 4 on the 
cone T(g, C), which is indeed a { 0, 1 }-cone. □ 

Proposition 10. Suppose that g G C. Then g is extreme of type t if and only if the following 
statement holds for all I G { 1, . . . , d }: 

Vx G T(g,C) n{0,l} d , xi = => x t = 0. (7) 

Proof. Suppose that g is extreme of type t. By Th. 2, 1 is the unique element of T(g, C) n{ 0, 1 } d 
whose t-th coordinate is 1. Let I G and a; G T(g, C) n { 0, 1 } d such that x l = 0. 

Then a; ^ 1, so that a; t = 0. 

Conversely, since g GC, then 1 G T(g,C) by Prop. 2. Besides, consider a: G T(g, C) (~){ 0, 1 } d 
such that a;t = 1. Then for all / G { 1, . . . ,d}, x\ = 1 by Eq. (7), so that x = 1. This proves 
that 1 is the unique element of T(g, C) n { 0, 1 } d whose t-th coordinate is 1, which implies that 
g is extreme of type t by Th. 2. □ 

Proof (Prop. 5). Suppose that t is reachable from / in Tt(g,C). Suppose that x G T(g,C) n 
{0,l} d such that x\ = 0. Let us show by induction on the definition of the reachability in 
H(g,C) that x t = 0: 

— if t = I, then obviously Xt = 0. 

— otherwise, there exists a hyperedge e such that t G i?(e) and for all j G T(e), j is 
reachable from I. By induction hypothesis, a?j = 0. Let A^y < B^y be the inequality 
associated to e. Then arg m&yJyA^g) = H(e) and arg max(Sjtgi) = T(e). Since a; satis- 
fies max i6argmax(Afeg) xt < max jeargmax(Bfeg) as^-, we have max ieH(e) x t < max jeT(e) Xj = 0. 
Hence x t = 0, so that Eq. (2) holds. 

Conversely, suppose t is not reachable from I. Let us define a; = x\ by £Cj = if z is 
reachable from I, and 1 otherwise. Let 1 < k < n such that A^g = B^g. Let e be the associated 
hyperedge in H(g,C). Remember that arg max(Akg) = H(e) and arg max(Bkg) = T(e). If for 
all j G T(e), j is reachable from I, then all elements i G H(e) are also reachable from I, so 
that max i6argmax(Afcg ) x» = max jear g max(jBfeg ) Xj = 0. Otherwise, there exists j G T(e) which is 
not reachable from I, so that maXj gargmax ( Bfcg ) Xj = 1. Since maxj eargmax (^ feg ) Xj is always less 
than or equal to 1, the inequality maxjgaj-gjnax^g) X{ < maXj gargmax ( Bfeg ) holds. It follows that 
T(g, C) n { 0, 1 } d admits an element x such that x\ = and as t = 1. □ 

Proof (Th. 3). From Prop. 5 and 10, g is extreme of type t if and only if t is reachable from 
any node / of the hypergraph H(g,C). This holds if and only if t belongs to a Sec C such that 
C dm(g,C D for any Sec D. As a consequence, g is extreme if and only if a least Sec exists. □ 



C Proofs of Prop. 6 and 7 



The reachability relation in a digraph G is denoted by ~~>g- We first introduce a lemma: 

Lemma 2. Let TL be a directed hypergraph. Each Sec C of Tt is of the form UjC^ where the 
C[ are the Sees o/ G(ft) such that CnC^fl. 

Proof. Formally, consider u G G. Then there exists a Sec C" of G(ft) such that it G G (since 
the Sees of G(ft) form a partition of the set A/"), and obviously C f~l C ^ 0. 

Conversely suppose that C is a Sec of G(ft) such that G n C" / 0. Let it G G n C". Then 
for any i; G C, it ^>G(n) v ^G{n) u > so that u -^>n v ~~>-ft u, hence v G C. □ 

Proof (Prop. 6). First suppose that {u} is a minimal Sec of G(H). Suppose that there exists 
d/m such that u v. Consider a hyperpath ei,...,e p from u to t; in ft. Then there must be 
a hyperedge ej such that T(ej) = { it } and H(ei) / { it } (otherwise, the hyperpath is a cycle 
and i> = it). Let if G i/(ej) \ { it }. Then (it, w) is an edge of G(H). Since { u } is a minimal Sec 
of G(ft), this enforces w = u, which is a contradiction. Hence { it } is a minimal Sec of ft. 

Conversely, consider a minimal Sec G of ft. Let it G G, and let D be the Sec of G(TC) 
containing it. Consider D' a minimal Sec of G(H) such that D' ^g{H) ^> an d C" be a Sec 
of ft such that D' n G' ^ 0. By Lemma 2, we have D' C G'. It follows that C' < n G, hence 
G = G' by minimality of G. Thus, D' C G, and since D' is a singleton, it also forms a Sec of 
ft using the first part of the proof. This shows D' = C (since the Sees of ft form a partition of 
the set of nodes), so that G is a minimal Sec of G(ft). □ 

Proof (Prop. 7). Let ft' = /(ft). Suppose that s t. Observe that if X, Y are subsets of N, 
f(X) C fiX) as soon asIcF, and f(X UY) C f(X) U f(Y). Therefore, if e u . . . , e p is a 
hyperpath from s to t, then: 

T(ei) C { s } U fT(ei) U • • • U Hfe-i) for all 1 < i < p 

t G H(e p ) 

so that: 

/(T(ei)) C { f(s) } U /Off(ei)) U • • • U f(H( ei ^)) for all 1 < i < p 

/(t) G /(fr(ep)) 

It follows that /(s) "~*f(w) f(t)- 

Conversely, suppose that f{t) is reachable from f(s) in ft', and that f(t) / /(s) (the case 
f(t) = f( s ) is trivial). Let H = { s } and T p+1 = { t }. 

By definition, there exist e\ = (Ti, fti), . . . , e p = (T p ,H p ) in E such that for each i G 
{l,...,p+l},f(T i )cf(H )U---Uf(H i _ 1 ). 

Also note that for any s G p(N), f(s) = sinsn{x,t/}=0 and /(s) = sU{xy}\{x,y} 
otherwise. In particular, as soon as xy f(s), f(s) coincides with s. Besides, f(s) \{xy} C 
s C f(s) \ {xy} U {x,y}. 

Two cases can be distinguished: 

(a) suppose that xy does not belong to any f(Hj), so that f(Hj) = Hj. Similarly, for each 
i > 1, f(Ti) does not contain xy, hence /(Tj) = Tj. Besides, Tj C -ffo U ■ ■ ■ U -fti-i for each 
i, so that is is straightforward that f(s) /(t). 

(b) now, if xy in one of the f(Hj), let A: be the smallest integer such that xy G f(Hk). Say for 
instance that x G -ftfe- Let (T[,H{), . . . , {T' q , H' q ) be taken from a hyperpath from x to y in 
ft. 

When i < k, /(Tj) does not contain xy, hence /(Tj) = Tj and Tj C f(H ) U • • • U /(^-i) = 
fl" U---Ufli_i. 
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Fig. 11. First intermediary form of our almost linear algorithm on hypergraphs 



Besides, T{ = { x } C H U • • • U H k , and for each i G { 2, . . . , q }, T[ C H U • • • U H k U U 
• • • U since x G H k . 

Finally, let us prove for i > k + 1 that Tj C -ff U ■ ■ ■ U H k U H[ U ■ ■ ■ U H' q U H k+1 U . . . 

Clearly, /(T^) \ {xy} C U}=o(/(^i) \ { x u})- Besides, x £ H k and y £ H' q , and since Tj is 
included into f(Ti) \ { xy } U {x, y}, then T« is also contained in Ho U • • • U H k U H[ U • • • U 

ff;u%iu-uffi-i. 

It follows that (Tj, Hi) i=h _ jk , (T{, -H"-)j=i,..., ? , i?i)i=fc+i,..., p forms a hyperpath from s to 
t in □ 

D Proof of Th. 4 

Theorem 4 contains two statements, a first one relative to the correctness of MinSccCount 
(i.e. it precisely computes the minimal Sees), and a second one to its time complexity. We first 
focus on the first part. 

D.l Correctness of the Algorithm 

The correctness proof of the algorithm MinSccCount turns out to be harder than the one of the 
classical Tarjan's algorithm, due to the complexity of the invariants which arise in the former 
algorithm. That is why we propose to show the correctness of two intermediary algorithms, 
named MinSccCount2 (Fig. 11) and MinSccCount3 (Fig. 12), and then to prove that they 
are equivalent to MinSccCount. 

The main difference between the first intermediary form and MinSccCount is that it does 
not use auxiliary data associated to the hyperedges to determine which ones are added to the 
digraph G(7i cur ) after a node merging step. Instead, the stack F is directly filled with the right 



hyperedges (Lines 18 and 43). Besides, a boolean no -merge is used to determine whether a node 
merging step has been executed. The notion of node merging step is refined: it now refers to the 
execution of the instructions between Lines 35 and 44 in which the boolean nojmerge is set to 
false. 

For sake of simplicity, we will suppose that sequences of assignment or stack manipulations 
are executed atomically. For instance, the sequences of instructions located in the blocks from 
Lines 14 and 19, or from Lines 35 and 44, and at from Lines 52 to 54, are considered as elemen- 
tary instructions. Under this assumption, intermediate complex invariants do not have to be 
considered. 

We first begin with very simple invariants: 

Invariant 1. Let U be a node of the current hypergraph TC C ur- Then index[U] is defined if and 
only if index[u] is defined for all u £ N such that Find(u) = U. 

Proof. It can be shown by induction on the number of node merging steps which has been 
performed on U. 

In the basis case, there is a unique element u G N such that Find(u) = U. Besides, U = u, 
so that the statement is trivial. 

After a node merging step yielding the node U, we necessarily have index[U] 7^ undef. 
Moreover, all the nodes V which has been merged into U satisfied index [V] 7^ undef because 
they were stored in the stack S. Applying the induction hypothesis terminates the proof. □ 

Invariant 2. Let u G N. When index[u] is defined, then Find(k) belongs either to the stack S, 
or to the set Finished (both cases can not happen simultaneously) . 

Proof. Initially, Find(u) = u, and once index[u] is defined, Find(u) is pushed on S (Line 16). 
Naturally, u Finished, because otherwise, index[u] would have been defined before (see the 
condition Line 54). After that, U = Find(u) can be popped from S at three possible locations: 

— at Lines 36 or 40, in which case U is transformed into a node U' which is immediately pushed 
on the stack S at Line 42. Since after that, Find(u) = U' , the property Find(u) G S still 
holds. 

- at Line 53, in which case it is directly appended to the set Finished. □ 
Invariant 3. The set Finished is always growing. 

Proof. Once an element is added to Finished, it is never removed from it nor merged into 
another node (the function Merge is always called on elements immediately popped from the 
stack S). □ 

Proposition 11. The function MinSccCount2('H) returns the number of minimal Sec ofH. 
Besides, the minimal Sees are formed by the sets { v £ N | Find(v) = U and ismin[U] = true }. 

Proof. We prove the whole statement by induction on the number of node merging steps. 

Basis Case. First, suppose that the hypergraph H is such that no nodes are merged during 
the execution of MinSccCount2('H), i.e. the node merging loop (from Lines 37 to 41) is never 
executed. Then the boolean no.merge is always set to true, so that n is never redefined to 
i + 1 (Line 46), and there is no back edge to Line 20 in the control-flow graph. It follows that 
removing all the lines between Lines 35 to 47 does not change the behavior of the algorithm. 
Besides, since the function Merge is never called, Find(u) always coincides with u. Finally, 
at Line 18, F is precisely assigned to the set of simple hyperedges leaving u in 7i, so that 
the loop from Lines 20 to 32 iterates on the successors of u in G(H). As a consequence, the 
algorithm MinSccCount2(H) behaves exactly like MinSccCount(G(W)). Moreover, under 
our assumption, the minimal Sees of G(Ti.) are all reduced to singletons (otherwise, the loop 
from Lines 37 to 41 would be executed, and some nodes would be merged). Therefore, by Prop. 6, 
the statement in Prop. 11 holds. 



Inductive Case. Now suppose that the node merging loop is executed at least once, and that 
its first execution happens during the execution of, say, HVisit2(x). Consider the state of 
the algorithm at Line 35 just before the execution of the first occurrence of the node merging 
step. Until that point, Find(-u) is still equal to v for all node v £ N, so that the execution of 
MinSccCount(W) coincides with the execution of MinSccCount(G(W)). Consequently, if C 
is the set formed by the nodes y located above x in the stack S (including x), C forms a minimal 
Sec of GiTL). In particular, the elements of C are located in a same Sec of the hypergraph TL. 

Consider the hypergraph TL' obtained by merging the elements of C in the hypergraph 
(N,E \ { e | 3y £ C s.t. T(e) = {y}}), and let X be the resulting node. For now, we may 
add a hypergraph as last argument of the functions HVisit2, Find, ... to distinguish their 
execution in the context of the call to MinSccCount2(H) or MinSccCount2(W'). We make 
the following observations: 

— the node x is the first element of C to be visited during the execution of MinSccCount2(H). 
It follows that the execution of MinSccCount2(H) until the call to HVisit2(x,H) coin- 
cides with the execution of MinSccCount2(W) until the call to HVisit2(X, TL'). 

— besides, during the execution of HVisit2(x, TL), the execution of the loop from Lines 20 to 32 
only has a local impact, i.e. on the ismin[y], index[y], or low[y] for y £ C, and not on nb 
or any information relative to other nodes. Indeed, we claim that the set of the nodes y on 
which HVisit2 is called during the execution of the loop is exactly C\ {x}. First, for all 
y £ C \ {x}, HViSiT2(y) has necessarily been executed after Line 20 (otherwise, by Inv. 2, 
y would be either below x in the stack S, or in Finished). Conversely, suppose that after 
Line 20, there is a call to HVlSlT2(t) with t C. By Inv. 2, t belongs to Finished, so that 
for one of the nodes w examined in the loop, either w £ Finished or ismin[w] = false after 
the call to HVisit2(u>). Hence ismin[x] should be false, which contradicts our assumptions. 

— finally, from the execution of Line 47 during the call to HVisit2(x, TL), our algorithm behaves 
exactly as MinSccCount2(H') from the execution of Line 20 in HVisit2(X, TL'). Indeed, 
index[X] is equal to i, and the latter is equal to n — 1. Similarly, for all y £ C, low[y] = i 
and ismin[y] = true. The node X being equal to one of the y £ C, we also have ZowLY] = % 
and ismin[X] = true. Moreover, X is the top element of S. 

Furthermore, it can be verified that at Line 43, the set F contains exactly all the hyperedges 
of E which generate the simple hyperedges leaving X mTL 1 : they are exactly characterized 
by 

Find(z, H) = X for all z £ T(e), and T(e) / { y } for all y £ C 
<^=^ Find(z,H) = X for all z £ T(e), and collected e = false 

since at that Line 43, a hyperedge e satisfies collected e = true if and only if T(e) is reduced 
to a singleton {t} such that index [t] is defined. 

Finally, for all y £ C, FiND(y, TL) can be equivalently replaced by Find(X,W). 

As a consequence, MinSccCount2(H) returns the same result as MinSccCount2(W). Be- 
sides, both functions perform the same union-find operations, except the first the node merging 
step executed by MinSccCount2(H) on C. 

Let / be the function which maps all nodes y £ C to X, and any other node to itself. We 
claim that TL' and fiTL) have the same reachability graph, i.e. ~»^/ and ~^>f(n) are identical 
relations. Indeed, the two hypergraphs only differ on the images of the hyperedges e £ E such 
that T{e) = {y} for some y £ C. For such hyperedges, we have H{e) C C, because otherwise, 
ismin[x] would have been set to false (i.e. the Sec C would not be minimal). It follows that 
their are mapped to the cycle ({X},{JT}) by /, so that TL' and fiTL) clearly have the same 
reachability graph. In particular, they have the same minimal Sees. 



1: function MinSccCount3(JV, E) 34: while F is not empty do 

2: n := 0, nb := 0, S := [], Finished := 35: pop e from F 

3: for all e G F do 36: for all wj 6 H(e) do 

4: r e := undef , c e := 37: local W := Find(mj) 

5: collectede := false 38: if low[W] = undef then HVisit(w) 

6: done 39: if W £ Finished then 

7: for all u £ N do 40: ismin[U] := false 

8: irtde:r[u] := undef, low[u] := undef 41: else 

9: Makeset(u), F„ := [] 42: Zotu[I/] := min(low[U], low[W]) 

10: done 43: ismin[U] := ismm[(7] &&: ismin[W] 

11: for all u £ TV do 44: end 

12: if index[u] = undef then HVisit3(m) 45: done 

13: done 46: done 

14: return nb 47: if low[U] = index[U] then 

15: end 48: if ismin\U] = true then 

49: local i := index[U] 

16: function Visit3(m) 50: pop each e £ Fjy and push it on F 

17: local U := Find(u), local F := [] 51: pop V from S 

18: index[U] := n, low[U] := n, n := n + 1 52: while index[V] > i do 

19: ismm[[/] := true, push 1/ on the stack S 53: pop each eeFy and push it on F 

20: for all ee£„do 54: U := Merge(E/, V) 

21: if |T(e)| = 1 then push e on F 55: pop V from 5 

22: else 56: done 

23: if r e = undef then r e := u 57: index[U] := i, push U on S 

24: local R e := FlND(r e ) „ J _ I collectede. = false 

25: if i? e appears in 5 then °' V :_ \ 6 I and Vx £ T(e), FlND(x) = U 

26: c e := c e + 1 59: for all e £ F do collectede ■= true 

27: if c e = |T(e)| then 60: if F ^ then go to Line 34 

28: push e on the stack Fr^ 61: nfe := n& + 1 

29: end 62: end 

30: end 63: repeat 

31: end 64: pop V from S, add V to Finished 

32: done 65: until index[V] = index[U] 

33: for all e S F do collectede := true 66: end 

67: end 



Fig. 12. Second intermediary form of our linear algorithm on hypergraphs 

Finally, since the elements of C are in a same Sec of TC, Prop. 7 shows that the function / 
induces a one-to-one correspondence between the Sees of H and the Sees of f(H). 

D /(D) 

(D'\{I})UC< — i £>' ifleD' 
D' < — i I?' otherwise. 

The action of the function / exactly corresponds to the node merging step performed on C. 
Since by induction hypothesis, MinSccCount2(W) determines the minimal Sees in f(H), it 
follows that Prop. 11 holds. □ 

The second intermediary version of our algorithm, MinSccCount3, is based on the first one, 
but it performs the same computations on the auxiliary data r e and c e as in MinSccCount. 
However, the latter are never used, because at Line 58, F is re-assigned to the value provided 
in MinSccCount2. It follows that for now, the parts in gray can be ignored. The following 
lemma states that MinSccCount2 and MinSccCount3 are equivalent: 

Proposition 12. Let H be a hypergraph. Then MinSccCount3(H) returns the number of 
minimal Sec ofTL. Besides, the minimal Sees are formed by the sets { v G N \ Find(w) = 
U and ismin[U] = true}. 

Proof. When HVisit3(w) is executed, the local stack F is not directly assigned to the set 
{ e G E | T(e) = { u} } (see Line 18 in Fig. 11), but built by several iterations on the set E u 
(Line 21). Since u £ T(e) and |T(e)| = 1 holds if and only if T(e) is reduced to { u }, HVisit3(m) 
initially fills F with the same hyperedges as HVisit2(m). 



Besides, the condition no.merge = false in HVisit2 (Line 45) is replaced by F 7^ (Line 60). 
We claim that the condition F 7^ can be safely used in HVisit2 as well. Indeed, in HVisit2, 
F 7^ implies no. merge = false. Conversely, suppose that in HVisit2, no .merge = false and 
F = 0, so that the algorithm goes back to Line 47 after having no -merge to true. The loop from 
Lines 20 to 32 is not executed since F = 0, and it directly leads to a new execution of Lines 33 
to 45 with no .merge = true. Therefore, going back to Line 47 was useless. 

Finally, during the node merging step in HVisit3, n keeps its value, which is greater than 
or equal to i + but is not necessarily equal to i + 1 like in HVisit2 (just after Line 46). This 
is safe because the whole algorithm only need that n take increasing values, and not necessarily 
consecutive ones. 

We conclude by applying Prop. 11. □ 

We make similar assumptions on the atomicity of the sequences of instructions. Note that 
Inv. 1, 2, and 3 still holds in HVisit3. 

Invariant 4. Let e G E such that \T(e)\ > 1. If for all x G T(e), index[x] is defined, then the 
root r e is defined. 

Proof. For all x G T(e), HVisit3(x) has been called. The root r e has necessarily been defined at 
the first of these calls (remember that the block from Lines 17 to 33 is supposed to be executed 
atomically) . □ 

Invariant 5. Consider a state cur of the algorithm in which U G Finished. Then any node 
reachable from U in G(TL cur ) is also in Finished. 

Proof. The invariant clearly holds when U is placed in Finished. Using the atomicity assump- 
tions, the call to HVisit3(u) is necessarily terminated. Let old be the state of the algorithm 
at that point, and Hold and Finished id the corresponding hypergraph and set of terminated 
nodes at that state respectively. Since HVisit3(u) has performed a depth-first search from the 
node U in G(7i id), all the nodes reachable from U in Ti id stand in Finished id- 

We claim that the invariant is then preserved by the following node merging steps. The 
graph edges which may be added by the latter leave nodes in S, and consequently not from 
elements in Finished (by Inv. 2). It follows that the set of reachable nodes from elements of 
Finished id is not changed by future node merging steps. As a result, all the nodes reachable 
from U in G(7i cur ) are elements of Finished y. Since by Inv. 5, Finished id C Finished, this 
proves the whole invariant in the state cur. □ 

Invariant 6. In the digraph G(7i cur ), at the call to HVisit3(u), u is reachable from a node W 
such that index[W] is defined if and only if W belongs to the stack S. 

Proof. The "if" part can be shown by induction. When the function HVisit3(u) is called from 
Line 12, the stack S is empty, so that this is obvious. Otherwise, it is called from Line 38 during 
the execution of HVisit3(x). Then X = Find(x) is reachable from any node in the stack, since 
x was itself reachable from any node in the stack at the call to Find(X) (inductive hypothesis) 
and that this reachability property is preserved by potential node merging steps (Prop. 7). As 
u is obviously reachable from X, this shows the statement. 

Conversely, suppose that index[W] is defined, and W is not in the stack. According to Inv. 2, 
W is necessarily an element of Finished. Hence u also belongs to Finished by Inv. 5, which is a 
contradiction since this can not hold at the call to HVisit(u). □ 

Invariant 7. Let e G E such that |T(e)| > 1. Consider a state cur of the algorithm MinScc- 
Count3 in which r e is defined. 

Then c e is equal to the number of elements x G T(e) such that index[x] is defined and 
Find(x) is reachable from FlND(r e ) in G(H CUT ). 



Proof. Since at Line 26, c e is incremented only if R e = FlND(r e ) belongs to S, we already know 
using Inv. 6 that c e is equal to the number of elements x G T(e) such that, at the call to 
HVisit3(x), x was reachable from FlND(r e ). 

Now, let x G N , and consider a state cur of the algorithm in which r e and index [x] are 
both defined, and FiND(r e ) appears in the stack S. Since index[x] is defined, HVisit3 has been 
called on x, and let old be the state of the algorithm at that point. Let us denote by T~C id 
and TC C ur the current hypergraphs at the states old and cur respectively. Like previously, we 
may add a hypergraph as last argument of the function Find to distinguish its execution in 
the states old and cur. We claim that FiND(r e , T~C C ur) ^G(H C ur) Find(x, Ti C ur) if and only if 
FlND(r e , 'Hold) ^G{H id) x - T ne "if" P ar t i s due to the fact that reachability in G(Ti, id) is not 
altered by the node merging steps (Prop. 7). Conversely, if x is not reachable from FiND(r e , Hold) 
in H id, tnen FlND(r e , H id) is not in the call stack S id (Invariant 6), so that it is an element of 
Finished id. But Finished u C Finished cur , which contradicts our assumption since by Inv. 2, 
an element can not be stored in Finished cur and S cur at the same time. It follows that if r e is 
defined and FlND(r e ) appears in the stack S, c e is equal to the number of elements x G T(e) 
such that index[x] is defined and FiND(r e ) ~^g(W cut .) Find(x). 

Let cur be the state of the algorithm when FiND(r e ) is moved from S to Finished. The invari- 
ant still holds. Besides, in the future states new , c e is not incremented because FiND(r e , 7~Lcur) G 
Finishedcur C Finished new (Inv. 3), so that FiND(r e , Tt n ew) = FiND(r e , H C ur), and the lat- 
ter can not appear in the stack S new (Inv. 2). Furthermore, any node reachable from R e = 
FiND(r e ,Tt new ) in G(H n ew) belongs to Finishednew (Inv. 5). It even belongs to Finished CU r, as 
shown in the second part of the proof of Inv. 5 (emphasized sentence) . It follows that the number 
of reachable nodes from FiND(r e ) has not changed between states cur and new. Therefore, the 
invariant on c e will be preserved, which completes the proof. □ 

Proposition 13. In HVisit3, the assignment at Line 58 does not change the value of F. 

Proof. It can be shown by strong induction on the number p of times that this line has been 
executed. Suppose that we are currently at Line 49, and let X\, . . . , X q be the elements of the 
stack located above the root U = X± of the minimal Sec of G(TL cur ). Any edge e which will 
transferred to F from Line 49 to Line 56 satisfies c e = |T(e)| > 1 and FiND(r e ) = Xi for some 
1 < i < q (since at 49, F is initially empty). Invariant 7 implies that for all elements x G T(e), 
Find(x) is reachable from Xi in G(7i. cur ), so that by minimality of the Sec C = { Xi, . . . , X q }, 
Find(x) belongs to C, i.e. there exists jl such that Find(x) = Xj. It follows that at Line 56, 
Find(x) = U for all x G T(e). Then, we claim that collected e = false at Line 56. Indeed, e' G E 
satisfies collected e > = true if and only 

- either it has been copied to F at Line 21, in which case |T(e')| = 1, 

— or if it has been copied to F at the r-th execution of Line 58, with r < p. By induction 
hypothesis, this means that e' has been pushed on a stack Fx and then popped from it 
strictly before the r-th execution of Line 58. 

Observe that a given hyperedge can be popped from a stack F x at most once during the whole 
execution of MinSccCount3. Here, e has been popped from Fx t after the p-th execution of 
Line 58, and \T(e)\ > 1. It follows that collected e = false. 

Conversely, suppose for that, at Line 58, collected e = false, and all the x G T(e) satisfies 
Find(x) = U. Clearly, |T(e)| > 1 (otherwise, e would have been placed into F at Line 21 and 
collected e would be equal to true). Few steps before, at Line 49, Find(x) is equal to one of Xj, 
1 < j < q- Since index[Xj] is defined (Xj is an element of the stack S), by Inv. 1, index[x] is 
also defined for all x G T(e), hence, the root r e is defined by Inv. 4. Besides, FiND(r e ) is equal 
to one of the Xj, say X^ (since r e G T(e)). As all the Find(x) are reachable from FiND(r e ) 
in G(H cur ), then c e = \T(e)\ using Invariant 7. It follows that e has been pushed on the stack 
Fr b , where R e = Fmr>(r e ,H id) m an previous state old of the algorithm. As collected e = false, 



e has not been popped from Fr b , and consequently, the node R e of TC id has not involved in 
a node merging step. Therefore, R e is still equal to FlND(()r e , T~C C ur) = X^. It follows that at 
Line 49, e is stored in Fx k , and thus it is copied to F between Lines 49 and 56. This completes 
the proof. □ 

We now can prove the correctness of MinSccCount. By Proposition 13, Line 58 can be 
safely removed in HVisit3. It follows that the booleans collected e are now useless, to that 
Lines 5, 33, and 59 can be also removed. After that, we precisely obtain the algorithm MinScc- 
Count. Proposition 12 completes the proof. □ 



D.2 Complexity Proof 

Then analysis of the time complexity MinSccCount depends on the kind of the instructions. 
We distinguish: 

(i) the operations on the global stacks F u and on the local stacks F, 

(ii) the call to the functions Find, Merge, and MakeSet, 

(hi) and the other operations, referred to as usual operations. By extension, their time com- 
plexity will be referred to as usual complexity. 

Also note that the function H Visit (u) is executed exactly once for each u £ N during the 
execution of MinSccCount. 



Operations on the Stacks F and F u . Each operation on the stack (pop or push) is in O(l). A 
given hyperedge is pushed on a stack of the form F u at most once during the whole execution 
of MinSccCount. Once it is popped from it, it will never be pushed on a stack of the form Fy 
again. Similarly, a hyperedge is pushed on a local stack F at most once, and after it is popped 
from it, it will never be pushed on any local stack F' in the following states. Therefore, the total 
number of stack operations on the local and global stacks F and F u is bounded to by 4 \N\. It 
follows that the corresponding complexity is 0(\N\). 

As a consequence, the total of the number of iterations of the loop from Lines 53 to 62 which 
occur during the whole execution of MinSccCount is bounded to J2 e eE \H(e)\. 

Union-find Operations. During the whole execution of MinSccCount, the function Find is 
called: 

- exactly \N\ times at Line 35, 

- at most Ylu^N l-^wl = J2e£E l^( e )l times at Line 42 (since during the call to HVisit(u), the 
loop from Lines 38 to 50 has exactly \E U \ iterations), 

- at most J2e£E \ H( e )\ at Line 54 (see above). 

Hence it is called at most size(H) times. 

The function Merge is always called to merge two distinct nodes. Let C\, . . . ,C P (p < 
| TV |) be the equivalence classes formed by the elements of N at the end of the execution of 
MinSccCount. Then Merge has been called at most J2i=i(\ c i\ ~ x )- Since J2i\ c i\ = \ N \> 
Merge is executed at most \N\ — 1 times. 

Finally, MakeSet is called exactly \N\ times. It follows that the total time complexity of 
the operations MakeSet, Find and Merge is 0(size(H) x a(|iV|). 

Usual Operations. The analysis of the usual complexity is split into several parts: 

- the usual complexity MinSccCount without the calls to the function HVisit is clearly 
0(\N\ + \E\). 



— during the execution of HVisit(u), the usual complexity of the block between Lines 35 and 50 
is 0(1) + 0{\E U \). Indeed, we suppose that the test at Line 43 can be performed in 0(1) by 
assuming that the stack S is provided with an auxiliary array of booleans which determines, 
for each element of N, whether it is stored in S. 4 Then the total usual complexity between 
Lines 35 and 50 is 0(s\ze(H)) for a whole execution of MinSccCount. 

— the usual complexity of the body of loop from Lines 53 to 62, without the recursive calls 
to HVisit, is clearly O(l). As mentioned above, the total number of iterations of this loop 
is less than J2etE \H( e )\ — s\ze(H). Therefore, the total usual complexity of the loop from 
Lines 51 to 63 is in 0(s\zeH). 

— the usual complexity of the loop between Lines 69 and 73 for a whole execution of MinSc- 
cCount is 0(|iV|), since in total, it is iterated exactly the number of times the function 
Merge is called. 

— the usual complexity of the loop between Lines 78 and so for a whole execution of MinScc- 
Count is 0(|iV|), because a given element is placed at most once into Finished. 

— if the two previous loops are not considered, less than 10 usual operations are executed in 
the block from Lines 64 to 82, all of complexity 0(1). The execution of this block either 
follows a call to HVisit or the execution of the goto statement (at Line 75). The latter is 
executed only if the stack F is not empty. Since each hyperedge can be pushed on a local 
stack F and then popped from it only once, it is executed \E\ in the worst case during the 
whole execution of MinSccCount. It follows that the usual complexity of the block from 
Lines 64 to 82 is 0(\N\ + \E\) in total (excluding the loops previously discussed). 

Total Time Complexity. Summing all the complexities above proves that the time complexity 



E An Example of an Execution Trace of the Algorithm MinSccCount 

In this section, we give the main steps of the execution of MinSccCount on the hypergraph 
depicted in Fig. 8: 



Nodes are depicted by solid circles if their index is defined, and by dashed circles otherwise. 
Once a node is placed into Finished, it is depicted in gray. Similarly, a hyperedge which has 
never been placed into a local stack F is represented by dotted lines. Once it is pushed into 
F, it becomes solid, and when it is popped from F, it is colored in gray (note that for sake of 
readability, gray hyperedges mapped to cycles after a node merging step will be removed) . The 
stack F which is mentioned always corresponds to the stack local to the last non-terminated 
call of the function Visit. 

Initially, Find(jz) = z for all z G { u, v, w,x,y,t}. We suppose that HVisit(u) is called first. 
After the execution of the block from Lines 35 to 50, the current state is: 



of MinSccCount is 0(size(W) x a(|7V|). 



□ 





4 Obviously, the push and pop operations on the stack S are still in O(l) under this assumption. 



Following the hyperedge e±, HVisit(u) is called during the execution of the block from Lines 51 
to 63 of HVisit(u). After Line 50 in HVisit(u), the root of the hyperedge is set to v, and the 
counter c e , is incremented to 1 since v £ S. The state is: 



index[v] — 1 
low[v] — 1 
ismin[v] — true 

r ei = v ' ' x " t 

index[u] = Vy/. c e4 = 1 ^ S= [v; u] 

low\u\ = (u\ \ n = 2 

.eh 



true^h. ,',„\ , y\ F - [e 2 



' w — ..* V ) 



nb = 



Similarly, HVisit(w) is called during the execution of the loop from Lines 51 to 63 of HVisit(w). 
After Line 50 in HVisit(u;), the root of the hyperedge es is set to w, and the counter c es is 
incremented to 1 since w 6 S. Besides, the c e4 is incremented to 2 = |T(e4)| since FiND(r e4 ) = 
Find(v) = v G S, so that is pushed on the stack F v . The state is: 



index [u] 
lowlu] 




index[w] = 2 [ t 
low [w] = 2 



1 



S = [w; w; u] 

n = 3 

F = [ C3 ] 
F„ = [e 4 ] 
nb = 



The execution of the loop from Lines 51 to 63 of HVisit(w) discovers that index [u\ is defined 
but u Finished, so that low[w] is set to mm(low[w], low[u\) = and ismin[w] to zsmm[iu] &fc 
ismin[u] = true. At the end of the loop, the state is therefore: 



index [u] 
low [u] 
isminlu] 



index[v] = 1 
low[v] — 1 
ismin[v] — true 




index[w] = 2 
low [w] = 
ismin[«j] = irue 



s = 


[w; v; u 


n = 


3 


F = 


[] 


F„ = 


N 


n& = 






Since towfw] ^ index[w], the block from Lines 64 to 82 is not executed, and HViSiT(tt;) termi- 
nates. Back to the loop from Lines 51 to 63 in HVisit(v), Zowfu] is set to mm(low[v], low[w]) = 0, 
and ismin[v] to ismin[v] &fc ismin[w] = true: 



index[v] — 1 
low[v] — 
ismin[v] — true 



index [u] 
lowlu] 




index[w] = 2 
low [w] = 
ismmfwl = true 



s = 


[w; v; u 


n = 


3 


F = 


[] 


F„ = 


[e 4 ] 


n& = 






Since Iow[d] 7^ meter [u], the block from Lines 64 to 82 is not executed, and HViSiT(f ) terminates. 
Back to the loop from Lines 51 to 63 in HVisit(u), Zoiufu] is set to min(/ow;[ti], Zowff]) = 0, and 
ismin[u] to ismin[u] &fe ismin[v] = true. Therefore, at Line 64, the conditions /ou;[ii] = index[u] 
and ismin[u] = true hold, so that a node merging step is executed. At that point, the stack F 
is empty. After that, i is set to index[u] = (Line 66), and F u = [] is emptied to F (Line 67), so 
that F is still empty. Then w is popped from S, and since index[w] = 2 > i = 0, the loop from 
Lines 69 to 73 is iterated. Then the stack F w = [] is emptied in F. At Line 71, Merge(u, w) is 
called. The result is denoted by U (in practice, either U = u or U = w). The state is: 



:[V] = 1 

low[v] = 

isminlv] = true „ r -, 

11 _ S = [v; u\ 



" ' F„ = [e 4 ] 

••^.-s i = 



/••-[; 

inrfes [[/J = or 2 i s r< , =w U = Find(w) = Find(w) 

low[U]=0 it}cll = l nb = 

ismin[U] = true 



Then v is popped from S, and since index[v\ = 1 > i = 0, the loop Lines 69 to 73 is iterated 
again. Then the stack F u = [ej is emptied in F. At Line 71, MERGE(L r , v) is called. The result 
is set to U (in practice, U is one of the nodes u, v, w). The state is: 



index[U] = 0, 1, or 2 

Zow[[7] = 
ismin[U] = irue 




/ s ' e 
l t ) C e 



= w 

= 1 



5=M 
n = 3 

^ = [] 
i = 

F = [C4] 

[/ = Find(m) = Find(w) = Find(w) 
n6 = 



After that, ix is popped from S, and as index [u] = = i, the loop is terminated. At Line 74, 
index[U] is set to i, and U is pushed on S. Since F / 0, we go back to Line 51, in the state: 

S = [U] 

index[U] = n = 3 

Zow[C/]=0 ([/J F=[e 4 ] 
ismin [f/j = true^ — J U = Find(w) = Find(w) = Find(w) 

nb = 




Then e4 is popped from F, and the loop from 53 to 62 iterates over H(e±) = {x,y}. Suppose 
that x is treated first. Then Visit(x) is called. During its execution, at Line 50, the state is: 



index[U] = 

low[U] = { / 
ismin[U] = true 



index[x] — 3 
low[x] — 3 
ismin[x] — true 




S = [x; U] 
n — 4 
F=[] 

U = Find(w) 
nb = 



Find(w) = Find(w) 



Since F is empty, the loop from Lines 51 to 63 is not executed. At Line 64, Iow[x] = index [x] and 
ismin[x] = true, so that a trivial node merging step is performed, only on x, since it is the top 
element of S. At Line 74, it can be verified that S = [x; U], index[x] = 3 and F = []. Therefore, 
the goto statement at Line 75 is not executed, and nb is incremented at Line 76. It follows that 
the loop from Lines 78 to 80 is executed, and after that, the state is: 



index[U] = 

low[U] =0 |f 
ismin[U] = true 



index[x] = 3 
low[x] = 3 
ismin\x] = true 




S=[U] 
n = 4 
F=[] 

U = Find(w) = Find(w) = Find(w) 
nb = 1 
Finished = { x } 



After the termination of HVlSlT(a;), since x £ Finished, ismin[U] is set to false. After that, 
HViSiT(y) is called, and at Line 50, it can be checked that c e5 has been incremented to 2 = |T(e5)| 
because R e5 = FiND(r e5 ) = Find(w) = U and U G S. Therefore, e§ is pushed to Fjj, and the 
state is: 



index[U] = 
low[U\ = 
ismin[U] = false 




= w 
= 2 



index[y] = 4 
low[y] — 4 
ismin[y] — true 



S=[y,U] 
n = 5 
F=[] 
Fu = [e B ] 
U = Find(m) = Find(w) = Find(w) 
nb = 1 
Finished = { x } 



As for the node x, HVisit(?/) terminates by incrementing nb, popping y from S and adding it 
to Finished. Back to the execution of HVisit(J7), at Line 64, the state is: 



index[x] — 3 
low[x] — 3 
isminlx] = true 



index[U] = 
low[U] = 
ismin[U] = false 




index[y] — 4 
low[y] — 4 
ismin[y] — true 



S=[U] 
n = 5 
F=[] 
Fu = [e B ] 
U = Find(m) = Find(u) = Find(w) 
nb = 2 
Finished = {y,x} 



While Zoioff/] = index[U], ismin[U] is equal to false, so that no node merging loop is performed 
on U. Therefore, e§ is not popped from Fjj and nb is not incremented. Nevertheless, the loop 
from Lines 78 to 80 is executed, and after that, HVisit(u) is terminated in the state: 



index[x] — 3 
low[x] — 3 
ismin[x] — true 



index[U] = 
low[U] = 
ismin[U] = false 




index[y] — 4 
low[y] — 4 
ismin[y] — true 



S=[] 
n = 5 

F=[] 
Fu = [e B ] 
U = Find(u) = 
n& = 2 
Finished = {U,y,x} 



Find(w) = Find(w) 



Finally, HVisrr(i) is called from MinSccCount at Line 10. It can be verified that a trivial 
node merging loop is performed on t only, and that nb is incremented. After that, t is placed 
into Finished. Therefore, the final state of MinSccCount is: 

index[x] = 3 
low[x] — 3 
ismin[x] = true 



index[U] = 
low[U] = 
ismin[U] = false 




index [y] = 4 
low[y] — 4 
ismin[y] — true 



S=[] 
n = 6 
Fu = [e B ] 
17 = Find(m) = Find(u) = Find(w) 
nb = 3 
Finished = {t,U,y,x} 



index[t] = 5 
ioty[t] = 5 
ismin[t] = irtte 



Consequently, there is 3 minimal Sees in the hypergraph. As ismin[x] = ismin[y] = ismin[t] 
true and ismin[FmD(z)] = false for z = u, v, w, they are given by the sets: 

{z | Find(z) = x} = {x} 
{z\ FlND(z) =y} = {y} 

{z | Find(z) =t} = {t}. 



F Determining All Sees in a Hypergraph 

We just give an example of hypergraph for which our algorithm does not compute all the Sees, 
but only minimal ones: 




Our algorithm determines that { t } is the least Sec. However, u and x stand in the same Sec, 
but they are not merged by the algorithm. This is similar for v and y. 



